Video processing apparatus, video processing method, and program

ABSTRACT

A video processing apparatus includes an input unit configured to input information instructing a focus position, in a first shooting, an acquisition unit configured, in a case in which the information instructing a focus position has been input by the input unit, to acquire position information indicating the instructed focus position, and time information indicating a time at which the focus position has been instructed, and a control unit configured to control a focus position in a second shooting based on the position information and the time information that have been acquired by the acquisition unit.

BACKGROUND OF THE INVENTION

Field of the Invention

The present invention relates to a video processing apparatus, a method,and a program for performing processing for putting a focus on asubject.

Description of the Related Art

In the production of content such as films, in some cases, videos ofrespective scenes constituting video content are each formed of aplurality of shot videos (short videos each serving as the minimum unitof shooting). In addition, when shooting of each shot video of aspecific scene video is performed, shooting is repetitively performed insimilar situations, obtained shot videos are compared, and a shot videoto be finally used is selected. The specific scene video can be therebygenerated.

In addition, there has been known the following method of putting afocus on a subject in such shooting. More specifically, a desired focusposition is prestored as a preset position, and a focus is put on thestored preset position according to a predetermined preset reproductioninstruction issued by an operator or the like (Japanese PatentApplication Laid-Open No. 8-334674).

In the method described in the above-described Japanese PatentApplication Laid-Open No. 8-334674, a focus cannot be put on a subjectexisting at a position different from the prestored preset position(focus position) at an appropriate timing. In other words, for putting afocus on a subject existing at a position different from the presetposition, a user needs to perform focus adjustment by a manual operationafter preset reproduction. Thus, works bothersome for the user have beenrequired.

SUMMARY OF THE INVENTION

For controlling a focus position in a second shooting according to afocus position instructed in a first shooting and a timing at which theinstruction has been received, for example, a video processing apparatusincludes the following configurations.

More specifically, a video processing apparatus includes an input unitconfigured to input information instructing a focus position, in a firstshooting, an acquisition unit configured, in a case in which theinformation instructing a focus position has been input by the inputunit, to acquire position information indicating the instructed focusposition, and time information indicating a time at which the focusposition has been instructed, and a control unit configured to control afocus position in a second shooting based on the position informationand the time information that have been acquired by the acquisitionunit.

Further features of the present invention will become apparent from thefollowing description of exemplary embodiments (with reference to theattached drawings).

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram illustrating a configuration example of avideo processing apparatus according to a first exemplary embodiment.

FIGS. 2A and 2B are diagrams illustrating a structure of video content.

FIG. 3 is a flowchart illustrating processing for storing a focusposition in rehearsal shooting according to the first exemplaryembodiment.

FIG. 4 is a flowchart illustrating focusing processing in real shootingaccording to the first exemplary embodiment.

FIG. 5 is a diagram illustrating an example of transition of a standposition of a subject in a studio set.

FIGS. 6A to 6C are diagrams illustrating a display screen in rehearsalshooting according to the first exemplary embodiment.

FIGS. 7A to 7C are diagrams illustrating a display screen in realshooting according to the first exemplary embodiment.

FIG. 8 is a block diagram illustrating a configuration example of avideo processing apparatus according to a second exemplary embodiment.

FIG. 9 is a flowchart illustrating focusing processing in real shootingaccording to the second exemplary embodiment.

FIGS. 10A to 10C are diagrams illustrating a display screen in realshooting according to the second exemplary embodiment.

FIG. 11 is a block diagram illustrating a configuration example of avideo processing apparatus according to a third exemplary embodiment.

FIG. 12 is a flowchart illustrating focusing processing in real shootingaccording to the third exemplary embodiment.

FIGS. 13A to 13C are diagrams illustrating a display screen in realshooting according to the third exemplary embodiment.

FIG. 14 is a block diagram illustrating a hardware configuration exampleof a computer applicable to a video processing apparatus.

FIG. 15 is a flowchart illustrating another example of focusingprocessing in real shooting according to the third exemplary embodiment.

DESCRIPTION OF THE EMBODIMENTS

An example of exemplary embodiments of the present invention will bedescribed below in detail with reference to the appended drawings. Inaddition, the configurations described in the following exemplaryembodiments are mere examples, and the present invention is not limitedto the configurations illustrated in the drawings. In video dataincluding 1 or more images, video data corresponding to 1 shot, which isthe minimum unit of shooting, will be hereinafter referred to as a shotvideo, and video data including 1 or more shot videos will behereinafter referred to as video content. In addition, the details ofthe shot video and the video content will be described, later using FIG.2.

First Exemplary Embodiment

A configuration of a video processing apparatus according to the presentexemplary embodiment will be described using FIG. 1. FIG. 1 is a diagramillustrating a configuration example of a video processing apparatusaccording to the present exemplary embodiment. The video processingapparatus according to the present exemplary embodiment is an imagingapparatus including an imaging unit. The video processing apparatus,however, is not limited to this. A video processing apparatus 100according to the present exemplary embodiment captures an image of asubject to generate video data. A lens 102 constitutes an imagingoptical system. An imaging unit 114 is an imaging unit for capturing animage of a subject, and includes a sensor for generating video data byconverting light imaged through the lens 102, into electric charge. Adisplay unit 103 displays an image (a video) captured by the imagingunit 114. A display control unit 104 controls the display unit 103 todisplay a video.

An input unit 106 receives an operation performed by an operator (auser), and acquires instruction information that is based on thereceived operation. For example, the input unit 106 detects a touchoperation performed on the display unit 103 by the user, and acquiresinformation instructing a focus position (position on a display screenon which a focus is to be put). In addition, in the present exemplaryembodiment, the display unit 103 receives an operation performed by theuser, and the input unit 106 acquires instruction information that isbased on the operation performed on the display unit 103 by the user.The configuration, however, is not limited to this. In other words, thedisplay unit 103 may have the function of the input unit 106.

An instruction unit 105 acquires, from the input unit 106, informationinstructing a focus position, and based on the information instructing afocus position, acquires information indicating a touched position(touch position) on a display screen of the display unit 103. Theinstruction unit 105 then outputs the acquired information to aprocessing unit on a subsequent stage. Here, the touch position refersto a position instructed by the user as a focus position (position onthe display screen on which a focus is to be put).

Based on the information indicating the touch position that has beenacquired from the instruction unit 105, a focus control unit 107 drivesthe lens 102 (performs focusing processing) so that a focus is put on asubject corresponding to the touch position. In addition, in the presentexemplary embodiment, a method used by the focus control unit 107 is notlimited to a method used for adjusting a focus, and the focus controlunit 107 may use any method as long as the method can perform focusingprocessing with respect to a specific position (instructed position) ona shooting screen. For example, the focus control unit 107 can adjust afocus (perform automatic focusing processing) using a phase differencedetection method, a contrast detection method, and the like.Alternatively, the focus control unit 107 may measure a distance from animaging position to a subject corresponding to a touch position, andperform focusing processing based on the distance. Yet alternatively,the focus control unit 107 may perform focusing processing using a tableindicating a relationship between a phase difference between a pluralityof light beams input via the lens 102, and a driving amount of the lens102.

A microphone 108 takes in audio as an electrical signal to generateaudio information (an audio signal). The microphone 108 then outputs thegenerated audio information to a detection unit 109. Based on the audioinformation output from the microphone 108, the detection unit 109detects, by voice recognition, an action call (predeterminedidentification information), which is an acting start command(instruction) issued by the user (shooting commander, director, etc.).In addition, the detection unit 109 acquires a timing at which theaction call has been detected (time information). A time measurementunit 110 measures a time using the time point at which the action callhas been detected by the detection unit 109, as a starting point. Basedon the time measured by the time measurement unit 110, a storage unit111 records (stores) information indicating an instructed focus position(touch position) and information indicating a time at which informationinstructing a focus position has been acquired (touch time).

When the shooting of a shot video similar to an already-shot shot video(another take of the same shot) is performed, a shooting instructionunit 112 outputs, to an overall control unit 101, retake informationindicating that similar shooting is to be performed. A coordinateacquisition unit 113 acquires (reads, calls) position information(information indicating a touch position) stored in the storage unit111.

The overall control unit 101 controls the units in the video processingapparatus 100, and transmits parameters between the units. In FIG. 1,lines indicating connections between the overall control unit 101 andthe other units in the video processing apparatus 100 are omitted. Inaddition, the overall control unit 101 can control the units in thevideo processing apparatus 100 and read/write parameters between theunits, through at least either a parameter signal line (not illustrated)or a register bus (not illustrated). In addition, in the presentexemplary embodiment, the overall control unit 101 in FIG. 1 isinstalled within the video processing apparatus 100. The installation ofthe overall control unit 101, however, is not limited to this. In otherwords, the overall control unit 101 may be installed outside the videoprocessing apparatus 100. In addition, the overall control unit 101 ofthe video processing apparatus 100 may control the units in the videoprocessing apparatus 100 and read/write parameters between the units,through at least either a parameter signal line or a register bus thatrelays between the overall control unit 101 and the video processingapparatus 100.

Next, the details of video content and a shot video will be describedusing FIG. 2A. FIG. 2A is a diagram illustrating an example of ahierarchical concept of video data, which is video content including 1or more shot videos, and is video data for creating video content suchas cinema videos and drama videos. Video content that is based on ascript or a scenario is formed of 1 or more sequences. One sequencevideo is formed of 1 or more scenes. Furthermore, 1 scene is formed of 1or more shots. In addition, the video processing apparatus 100 accordingto the present exemplary embodiment uses 1 shot as a unit of 1 shooting.

For example, a scene in which 2 people (persons A and B) are talkingwith each other can be shot by various shooting methods. Shots differentin shooting position and angle are shot. For example, such shots includea long shot obtained by shooting the talking 2 people in a wide angle, acloseup shot obtained by shooting the person B up close, a shoulder shotobtained by shooting the person A over the shoulder of the person B.Shots and scenes are not always shot in the order of a script or ascenario (reproduction order of video content), and in some cases, theyare shot in an order different from the reproduction order of the videocontent for the convenience of an imaging apparatus or a shooting set.In addition, shooting of 1 shot may be retaken many times due toinfluences such as an error of an actor or a staff, the intention of adirector, and unpredictable natural phenomenon. In this manner, shootingof the same shot will be referred to as a “take”, and if the shooting ofthe same shot is performed a plurality of times, a number called “take”is allocated to each shot video for identifying each shooting.

More specifically, by adding, as shot data, shooting information such asthe numbers of a sequence, a scene, a shot, and a take to each file ofeach shot video included in video content, each shot video can bespecified (identified). For example, in FIG. 2A, a video 201 isrepresented by a sequence 1, a scene 1, a shot 2, and a take 4, and avideo 202 is represented by a sequence 1, a scene 1, a shot 4, and atake 3. In addition, if there is a plurality of takes having the sameshot number, these takes are used as replaceable takes in the creationof video content. Such information indicating a take is added to a shotvideo by shooting a board on which the take is written, or added by theuser manually inputting the take. In addition, the terms such as“sequence”, “scene”, “shot”, and “take” described here are mere examplesfor describing the present exemplary embodiment, and different terms maybe used.

In addition, FIG. 2B is a diagram illustrating time axis coincidence(synchronization) of a plurality of takes of the same shot. Since eachtake of the same shot is a retake of the same shot, the time lengths ofvideos are substantially the same in many cases. In addition, the takesof the same shot can be replaced with one another when being edited asvideo content. Nevertheless, in actual shooting, times from the shootingstart to the beginning of a video of a portion used for video contentmay vary among shot videos of the respective takes. In such a case, byusing a timing of an action call (predetermined identificationinformation), which is an acting start command (instruction) issued by adirector, as time information indicating an acting start time,synchronization processing can be performed on the plurality of takes ofthe same shot. In other words, the video processing apparatus 100according to the present exemplary embodiment matches (synchronizes) thetimings of respective action calls in the plurality of takes of the sameshot. In addition, the action call timing is not limited to the timingat which an acting start instruction is issued by the user, and may betimings at which various instructions are issued by the user. Inaddition, for the shooting of a shot video without action such as videoonly including scenery, the timing of a roll call indicating a shootingstart may be used as an action call timing. In addition, the roll calltiming may be a timing at which a shooting start instruction is issuedby the user, or may be a timing at which the imaging unit 114 startsshooting processing. In addition, in the present exemplary embodiment,the time information is information indicating the time of the beginningof a video. The time information, however, is not limited to this, andmay be any information as long as the information indicates apredetermined time serving as a reference.

In the specific example illustrated in FIG. 2B, times from roll callsindicating shooting starts to action calls indicating acting starts oftakes 1 to 6 are not uniform. Nevertheless, because an actor startsacting in response to an action call, synchronization processing can beperformed on the takes based on time information indicating the timingof the action call. The video processing apparatus 100 according to thepresent exemplary embodiment can perform synchronization processing bymatching the time axes of videos of the plurality of takes of the sameshot using the time information indicating the action call, as areference time information 00:00, for example.

Next, the procedure of processing for storing an instructed focusposition that is performed by the video processing apparatus 100according to the present exemplary embodiment will be described indetail using FIGS. 3, 5, and 6A to 6C. In addition, the video processingapparatus 100 according to the present exemplary embodiment executes theprocessing for storing an instructed focus position (processing in FIG.3), when performing rehearsal shooting. The execution timing, however,is not limited to this, and the video processing apparatus 100 mayexecute the processing for storing an instructed focus position, whenperforming real shooting (e.g., when performing shooting of the firsttake of the same shot).

A case of shooting a shot video using a studio set illustrated in FIG. 5will be described below. In addition, FIG. 5 is a diagram illustratingan example of transition of a subject in a studio set, which is ashooting location. In addition, FIG. 5 is a diagram illustrating a casein which a studio set 501 simulating a room is viewed from a ceiling. Aposition 502 is a position at which shooting is performed (position atwhich the video processing apparatus 100 illustrated in FIG. 1 is set).The studio set 501 is provided with a door 503. A position 504 indicatesa stand position of an actor A. A position 505 indicates a position atwhich an actor B has stopped after opening the door 503 in rehearsalshooting. In addition, a position 506 indicates a position at which theactor B has stopped after opening the door 503 in real shooting.

In the present exemplary embodiment, when shooting is started, the actorA is shot in waist shot. Furthermore, an operation corresponding to anaction call is performed, and after 5 seconds from the action call, theactor B opens the door 503 to enter the room. In addition, when theoperation corresponding to the action call is performed, a focus is puton the actor A. Next, after 5 seconds from the action call, a focus isput on the actor B that has entered the room.

FIGS. 6A, 6B, and 6C are diagrams illustrating a display screen (anoperation screen) in the display unit 103 of the video processingapparatus 100 that is displayed when an instructed focus position isstored. A display screen 601 is a display screen (an operation screen)in the display unit 103. In addition, FIGS. 6A, 6B, and 6C correspond tothe lapse of time in shooting. In FIGS. 6A, 6B, and 6C, solid linesrepresents outlines of in-focus portions, and dotted lines representsoutlines of out-of-focus portions.

FIG. 6A illustrates an image displayed in the display unit 103 when theuser performs an operation corresponding to an action call (when thedetection unit 109 detects the action call). When the operationcorresponding to the action call is performed, as illustrated in FIG.6A, on the display screen 601 of the display unit 103, the actor A isdisplayed at a position 604, and an entrance 602 provided on the studioset is displayed. In addition, as illustrated in FIG. 6A, when theoperation corresponding to the action call is performed, a focus is puton the position 604 of the actor A.

FIG. 6B illustrates an image displayed in the display unit 103 after 5seconds from the operation corresponding to the action call. After apredetermined time from the operation corresponding to the action call,the actor B enters the studio set from the entrance 602. In addition,after 5 seconds from the operation corresponding to the action call, asillustrated in FIG. 6B, on the display screen. 601, the actor B isdisplayed at a position 605, and the entrance 602 and a door 603 openedby the actor B are displayed. In addition, on the display screen 601,similarly to that in FIG. 6A, the actor A is displayed at the position604. In addition, as illustrated in FIG. 6B, after 5 seconds from theoperation corresponding to the action call, a focus is put on theposition 604 of the actor A. Meanwhile, the focus is not put on theposition 605 of the actor B, the entrance 602, and the door 603 sincethey are at back positions distant from the position 604 of the actor A.At this time, the user touches a position 606 on the display screen 601for putting a focus on the position 605 of the actor B. In addition, asillustrated in FIG. 6B, the display control unit 104 displays a circulargraphic (cursor mark: hereinafter, simply referred to as a cursor) at afocus position (the position 606) on the display screen 601 of thedisplay unit 103 that has been instructed by the user.

FIG. 6C illustrates an image displayed in the display unit 103 after thefocusing processing for putting a focus on the position 606 illustratedin FIG. 6B is performed. As illustrated in FIG. 6C, focus is put on theposition 605 of the actor B. In addition, the focus is also put on theentrance 602 and the door 603 since the positions of the entrance 602and the door 603 are within a predetermined range from (in vicinity of)the position 606. In addition, the focus is not put on the actor A sincethe position 604 of the actor A is on the outside of the predeterminedrange (distant) from the position 606.

Next, FIG. 3 is a flowchart illustrating the details of processing forstoring a focus position instructed by the user that is performed by thevideo processing apparatus 100 according to the present exemplaryembodiment.

In step S301, the video processing apparatus 100 acquires informationindicating a shooting start that is provided by, for example, the usertouching a predetermined position on the display unit 103. Then, theimaging unit 114 according to the present exemplary embodiment startsshooting based on the acquired information indicating a shooting start.

In step S302, the video processing apparatus 100 acquires a timing atwhich an action call has been detected (time information of an actioncall). The detection unit 109 according to the present exemplaryembodiment detects an action call instructed by the user, based on audioinformation (voice of the user, specific sound, etc.) output from themicrophone 108. Then, the detection unit 109 acquires a timing at whichthe action call has been detected (time information, of an action call).In addition, a method for detecting time information of an action callis not limited to a method that is based on audio information. A methodof detecting time information of an action call in response to a button(not illustrated) corresponding to the action call being pressed by theuser may be used. Alternatively, a method of detecting time informationof an action call in response to the recognition of a gesturecorresponding to the action call that is performed by the user may beused. In addition, when an action call is detected in step S302 of thepresent exemplary embodiment, as illustrated in FIG. 6A, a state inwhich a focus is put on the actor A is obtained.

In step S303, based on the time information of the action call that hasbeen detected by the detection unit 109 in step S302, the timemeasurement unit 110 starts time measurement from a time point at whichthe action call has been detected. In other words, the detection unit109 according to the present exemplary embodiment uses the time point atwhich the action call has been detected, as a start time point of timemeasurement (time point 00:00 illustrated in FIG. 2B).

In step S304, the video processing apparatus 100 determines whetherinformation instructing a focus position has been acquired (detected).In addition, the input unit 106 according to the present exemplaryembodiment can acquire information instructing a focus position if thedisplay unit 103 receives an operation of instructing a focus positionthat is performed by the user. If it is determined in step S304 thatinformation instructing a focus position has been acquired (YES in stepS304), the video processing apparatus 100 advances the processing tostep S305. On the other hand, if it is determined in step S304 thatinformation instructing a focus position has not been acquired. (No instep S304), the video processing apparatus 100 advances the processingto step S307.

In step S305, the video processing apparatus 100 performs focusingprocessing based on the information instructing the focus position thathas been acquired in step S304. More specifically, the input unit 106according to the present exemplary embodiment outputs the informationinstructing the focus position, to the instruction unit 105. Then, basedon the information instructing the focus position, the instruction unit105 acquires information indicating a touch position, and outputs theacquired information to the focus control unit 107. Here, the touchposition is position corresponding to the information instructing thefocus position, and is a focus position on the display screen 601 of thedisplay unit 103 that has been touched (instructed) by the user. Then,based on the information indicating the touch position, the focuscontrol unit 107 performs processing for driving the lens 102 so that afocus is put on a subject existing at the touch position (focusingprocessing). Through the processing, a focus can be put on a subjectcorresponding to a touch position on a screen of the display unit 103 ofthe video processing apparatus 100. In addition, when informationinstructing a focus position is acquired in the present exemplaryembodiment, as illustrated in FIG. 6B, a cursor is displayed at theinstructed position 606 on the display screen 601 of the display unit103.

In step S306, the storage unit 111 stores the information indicating theinstructed focus position (touch position) and information indicating atiming at which the information instructing the focus position has beenacquired (touch time). More specifically, the storage unit 111 storesinformation indicating the touch position that has been output from theinstruction unit 105, and information indicating the touch time that hasbeen output from the time measurement unit 110. Here, the touch timerefers to a time from a time measurement start in step 303 to wheninformation instructing a focus position is acquired in step S304. Inother words, the video processing apparatus 100 according to the presentexemplary embodiment stores the position 606 illustrated in FIG. 6B, asinformation indicating a touch position, and stores a time from anaction call to when the position 606 is instructed by the user (5seconds in this example), as information indicating a touch time. Inaddition, the video processing apparatus 100 according to the presentexemplary embodiment stores information indicating a touch position andinformation indicating a touch time in association with each other.

In step S307, the video processing apparatus 100 determines whethershooting has ended. If it is determined in step S307 that shooting hasnot ended. (No in step S307), the video processing apparatus 100 returnsto the processing in step S304. On the other hand, if it is determinedin step S307 that shooting has ended (YES in step S307), the videoprocessing apparatus 100 ends the processing for storing an instructedfocus position.

In the above-described manner, the video processing apparatus 100according to the present exemplary embodiment can store an instructedfocus position. In addition, after the shooting has ended in step S307,the video processing apparatus 100 may save a file obtained by compilinginformation stored in step S306 for 1 shot video i.e., for 1 rehearsal,1 take), and the like into the storage unit 111 or another memory (notillustrated).

Next, the procedure of focusing processing that is executed by the videoprocessing apparatus 100 according to the present exemplary embodimentbased on a stored focus position will be described in detail using FIGS.4 and 7A to 7C. In addition, the video processing apparatus 100according to the present exemplary embodiment executes, in real shooting(the first take of the same shot), the focusing processing that isexecuted based on a stored focus position (processing in FIG. 4). Theexecution timing, however, is not limited to this, and the videoprocessing apparatus 100 is only required to execute the focusingprocessing (processing in FIG. 4) based on a stored focus position afterthe processing for storing an instructed focus position (processing inFIG. 3). For example, if processing for storing an instructed focusposition is executed in the shooting of the first take of the same shot,the focusing processing that is based on a stored focus position isexecuted in the shooting of the second and subsequent takes. Inaddition, similarly to the case of performing the processing illustratedin FIG. 3, a shot video is shot using the studio set illustrated in FIG.5.

First, FIGS. 7A, 7B, and 7C are diagrams illustrating the display screen601 in the display unit 103 of the video processing apparatus 100 thatis displayed when the focusing processing is performed based on a storedfocus position. In addition, FIGS. 7A, 7B, and 7C correspond to thelapse of time in shooting. In addition, in FIGS. 7A, 7B, and 7C, solidlines represents outlines of in-focus portions, and dotted linesrepresents outlines of out-of-focus portions.

FIG. 7A illustrates an image displayed in the display unit 103 when theuser performs an operation corresponding to an action call (when thedetection unit 109 detects the action call). As illustrated in FIG. 7A,when the operation corresponding to the action call is performed, afocus is put on a position 704 of the actor A.

FIG. 7B illustrates an image displayed in the display unit 103 after 5seconds from the operation corresponding to the action call. Asillustrated in FIG. 7B, on the display screen 601 of the display unit103, the actor B is displayed at a position 705, and the entrance 602and the door 603 opened by the actor B are displayed. In addition, onthe display screen 601, similarly to that in FIG. 7A, the actor A isdisplayed at the position 704. In addition, as illustrated in FIG. 7B,after 5 seconds from the operation corresponding to the action call,similarly to rehearsal shooting, the focus is not put on the position705 of the actor B, the entrance 602, and the door 603 since they are atback positions distant from the position 704 of the actor A. Inaddition, as illustrated in FIG. 7B, after 5 seconds from the operationcorresponding to the action call, a cursor is displayed at the position606 corresponding to information indicating a touch position that hasbeen read in real shooting.

FIG. 7C illustrates an image displayed in the display unit 103 after thefocusing processing for putting a focus on the position 606 illustratedin FIG. 7B is performed. As illustrated in FIG. 7C, a focus is put onthe position 705 of the actor B. In addition, the focus is not put onthe actor A since the position 704 of the actor A is on the outside ofthe predetermined range (distant) from the position 606.

Next, FIG. 4 is a flowchart illustrating the details of the focusingprocessing that is executed by the video processing apparatus 100according to the present exemplary embodiment based on a stored focusposition.

In step S401, the video processing apparatus 100 acquires informationindicating a touch time that is stored in the storage unit 111. In thepresent exemplary embodiment, when the focusing processing that is basedon a stored focus position is started, the shooting instruction unit 112outputs, to the overall control unit 101, retake information indicatingthat shooting similar to shooting in which processing for storing aninstructed focus position (processing in FIG. 3) has been performed isto be performed. Then, the overall control unit 101 acquires informationindicating a touch time that is stored in the storage unit 111. In thisexample, a touch time stored in step S306 in FIG. 3 (5 seconds) isacquired.

In addition, the video processing apparatus 100 according to the presentexemplary embodiment performs real shooting corresponding to rehearsalshooting in which the processing in FIG. 3 has been performed. Thus, thevideo processing apparatus 100 acquires the information stored in thestorage unit 111 in the rehearsal shooting (information indicating atouch position and/or a touch time). Nevertheless, information to beacquired is not limited to this. When the processing in FIG. 3 isperformed in the shooting of a first video (for example, take 1), andthe processing in FIG. 4 is performed in the shooting of a second video(for example, take 2) of the same shot as the first video, the videoprocessing apparatus 100 is only required to acquire information storedin the shooting in which the processing in FIG. 3 has been performed.

In addition, if the processing in FIG. 3 is performed in the shootingsof a plurality of videos (for example, takes 1 and 2 of the same shot),and then the processing in FIG. 4 is performed in the shooting of avideo (for example, take 3) of the same shot, the video processingapparatus 100 according to the present exemplary embodiment performs thefollowing processing. More specifically, the video processing apparatus100 according to the present exemplary embodiment acquires informationstored in the last shooting (shooting of take 2) out of shootings of thevideos (takes 1 and 2) in which the processing in FIG. 3 has beenperformed. In addition, the configuration of the video processingapparatus 100 according to the present exemplary embodiment is notlimited to this. The video processing apparatus 100 may be configuredto, if the processing in FIG. 3 is performed in the shootings of aplurality of videos, store a plurality of pieces of information storedin the shootings of the plurality of videos, and to select informationfrom among plurality of pieces of stored information when performing theprocessing in FIG. 4.

The processes in steps S402 to S404 are processes similar to therespective processes in steps S301 to 303 in FIG. 3. Thus, thedescription thereof will be omitted. In addition, when an action call isdetected in step S403 of the present exemplary embodiment, the imageillustrated in FIG. 7A is displayed on the display screen. 601. At thistime, as illustrated in FIG. 7A, a focus is put on the actor A.

In step S405, the video processing apparatus 100 compares a timemeasured by the time measurement unit 110, with a touch time acquired instep S401. In this step, if it is determined that the time measured bythe time measurement unit 110 does not match the touch time acquired instep S401 (No in step S405), the video processing apparatus 100 advancesthe processing to step S408, in which the video processing apparatus 100determines whether shooting has ended.

On the other hand, if it is determined in step S405 that the timemeasured by the time measurement unit 110 matches the touch timeacquired in step S401 (YES in step S405), the video processing apparatus100 advances the processing to step S406. The video processing apparatus100 according to the present exemplary embodiment determines that thetime measured by the time measurement unit 110 matches the touch timeacquired in step S401, when the time measured by the time measurementunit 110 is 5 seconds (i.e., 5 seconds after the action call). Then, thevideo processing apparatus 100 advances the processing to step S406.

In step S406, the video processing apparatus 100 acquires informationindicating a touch position that is stored in the storage unit 111. Morespecifically, the coordinate acquisition unit 113 according to thepresent exemplary embodiment acquires (reads, calls) informationindicating a touch position that is saved in the storage unit 111. Inaddition, the information indicating a touch position is informationcorresponding to the information indicating a touch time that has beenacquired in step S401. In this example, a touch position (the position606) stored in step S306 in FIG. 3 is acquired. Then, the coordinateacquisition unit 113 outputs the acquired information indicating a touchposition, to the instruction unit 105. Furthermore, the instruction unit105 outputs the information indicating a touch position, to the focuscontrol unit 107. In addition, when the information indicating a touchposition has been acquired in step S406 of the present exemplaryembodiment, as illustrated in FIG. 7B, the video processing apparatus100 displays a cursor at the position. 606 on the display screen 601 ofthe display unit 103 that corresponds to the information indicating atouch position.

In step S407, the video processing apparatus 100 performs focusingprocessing based on the information indicating a touch position that hasbeen acquired from the storage unit 111 in step S406. More specifically,the focus control unit 107 according to the present exemplary embodimentperforms processing for driving the lens 102 so that a focus is put onan actual subject corresponding to the touch position acquired from theinstruction unit 105 (focusing processing). In other words, according toan actual distance between the imaging unit 114 and the actual subject,the focus control unit 107 performs focusing processing while performingadjustment so that a focus is put on the subject. At this time, if theposition of the subject is shifted from the acquired touch position,adjustment is performed so that a focus is put on the position of theactual subject, and then, the focusing processing is performed. Throughthe processing, a focus can be put on a subject corresponding to a touchposition on a screen of the display unit 103 of the video processingapparatus 100

In step S408, similarly to the processing in step S307 in FIG. 3, thevideo processing apparatus 100 determines whether shooting has ended.Then, if it is determined in step S408 that shooting has not ended (Noin step S408), the video processing apparatus 100 returns to theprocessing in step S405. On the other hand, if it is determined in stepS408 that shooting has ended (YES in step S408), the video processingapparatus 100 ends the focusing processing that is based on a storedfocus position.

Through the processing in FIG. 4, the video processing apparatus 100according to the present exemplary embodiment can put a focus on theactor B even if a stand position of the actor B in real shooting (theposition 506 in FIG. 5) differs from a stand position of the actor B inrehearsal shooting (the position 505 in FIG. 5). In other words, asillustrated in FIG. 5, even if the distances from a shooting position(the position 502) are different (the position 506 is on the rear sideof the position 505), in step S407, focusing processing is performedwith respect to the touch position (the position 606) on the displayscreen 601. The video processing apparatus 100 according to the presentexemplary embodiment can thereby put a focus on the actor B.

As described above, according to the video processing apparatus of thepresent exemplary embodiment, a focus can be put on a subject existingat a position different from a prestored focus position.

In addition, after the video processing apparatus 100 according to thepresent exemplary embodiment acquires a touch time in step S401 in FIG.4, the video processing apparatus 100 acquires a touch position in stepS406. The acquisition timing, however, is not limited to this. In otherwords, the video processing apparatus 100 is only required to acquire atouch position before performing focusing processing in step S407. Forexample, the video processing apparatus 100 may acquire a touch time anda touch position in step S401.

In conventional focusing processing, a distance between a position of animaging apparatus for performing shooting and a position at which asubject is planned to stand is measured in advance, and a focus positionthat is based on the measured distance is stored as a preset position.In contrast to this, the video processing apparatus of the presentexemplary embodiment stores information indicating an instructed focusposition (touch position) and a time at which the informationinstructing the focus position has been acquired (touch time) in advance(in rehearsal shooting). In other words, the video processing apparatusof the present exemplary embodiment stores a focus position on ashooting screen instead of distance information. With thisconfiguration, the video processing apparatus of the present exemplaryembodiment can appropriately perform focusing processing even if theposition of a subject varies when shooting is performed again (in realshooting). Furthermore, the video processing apparatus of the presentexemplary embodiment can appropriately perform focusing processing evenwithout a measurement device used for measuring a distance inconventional focusing processing.

In addition, in some techniques for performing conventional focusingprocessing, a plurality of focus positions is prestored as presetpositions, and focusing processing is performed based on a presetposition selected from among the plurality of stored preset positions.On the other hand, the video processing apparatus of the presentexemplary embodiment stores a time at which information instructing afocus position has been acquired (touch time), in association with theinstructed focus position (touch position). Through the processing, thevideo processing apparatus of the present exemplary embodiment can reada touch position at an appropriate timing, and perform focusingprocessing. With this configuration, if a plurality of touch positionsis stored, the video processing apparatus the present exemplaryembodiment can appropriately perform focusing processing by using atouch time, without receiving, from the user, an operation of selectinga touch position to be used, from among the plurality of touchpositions.

Second Exemplary Embodiment

The above-described video processing apparatus 100 according to thefirst exemplary embodiment performs the following processing wheninformation instructing a focus position is input during shooting. Morespecifically, the video processing apparatus 100 according to the firstexemplary embodiment stores information indicating an instructed focusposition (touch position) and information indicating a timing at whichthe information instructing a focus position has been acquired (touchtime). Then, when performing subsequent shooting (for example, similarshooting), the video processing apparatus 100 according to the firstexemplary embodiment reads the stored information indicating a touchposition and information indicating a touch time, and performs focusingprocessing for putting a focus on the touch position corresponding tothe touch time. A video processing apparatus 100 according to the secondexemplary is configured to, when performing similar shooting, performobject detection with respect to a range within a predetermined distancefrom (in vicinity of) a touch position corresponding to a touch time,and perform focusing processing based on a detection result.

FIG. 8 is a diagram illustrating a configuration example of a videoprocessing apparatus according to the present exemplary embodiment. Inaddition, in FIG. 8 of the present exemplary embodiment, the unitshaving the same functions as the respective units in FIG. 1 of the firstexemplary embodiment are assigned the same reference numerals, and thedescription thereof will be omitted. The video processing apparatus 100according to the present exemplary embodiment differs from the videoprocessing apparatus 100 according to the first exemplary embodimentthat is illustrated in FIG. 1, in that an object detection unit 801 isincluded. The object detection unit 801 acquires an image (a video)captured by the imaging unit 114, and detects an object from a rangewithin a predetermined distance from a touch position (predeterminedrange around a touch position) in the acquired image.

In addition, in the present exemplary embodiment, the object detectionunit 801 detects a human body as an object, but the type of an object tobe detected is not limited to this. For example, the object detectionunit 801 may detect a human body, a face, a substance, a character, aregion with a specific shape or a specific color, or the like, as anobject. In addition, as a method for detecting an object from an image,for example, there is a method of extracting a feature point from animage, a method of detecting a motion using a plurality of images, andthe like. The detection method is not limited to these methods. Forexample, when the type of an object to be detected is a face, the objectdetection unit 801 can detect a face by extracting, from an image,feature points, outlines, or the like that correspond to eyes, a nose, amouse, and the like. In addition, the input unit 106 may acquire thetype of an object to be detected by the object detection unit 801, byreceiving an operation performed by the user, and the object detectionunit 801 may perform object detection based on the acquired type of anobject.

The video processing apparatus 100 according to the present exemplaryembodiment performs processing for storing an instructed focus position,similarly to the processing in the first exemplary embodiment that isillustrated in FIGS. 3 and 6A to 6C. Thus, the description thereof willbe omitted.

Next, the procedure of focusing processing that is executed by the videoprocessing apparatus 100 according to the present exemplary embodimentbased on a stored focus position will be described in detail using FIGS.9 and 10A to 10C. In addition, the video processing apparatus 100according to the present exemplary embodiment executes, in real shooting(the first take of the same shot), the focusing processing that isexecuted based on a stored focus position (processing in FIG. 9). Theexecution timing, however, is not limited to this, and the videoprocessing apparatus 100 is only required to execute the focusingprocessing (processing in FIG. 9) based on a stored focus position afterthe processing for storing an instructed focus position (processing inFIG. 3). For example, if processing for storing an instructed focusposition is executed in the shooting of the first take of the same shot,the focusing processing that is based on a stored focus position isexecuted in the shooting of the second and subsequent takes. Inaddition, similarly to the case of performing the processing illustratedin FIG. 3, a shot video is shot using the studio set illustrated in FIG.5.

First, FIGS. 10A, 10B, and 10C are diagrams illustrating the displayscreen 601 in the display unit 103 of the video processing apparatus 100that is displayed when the focusing processing is performed based on astored focus position. In addition, FIGS. 10A, 10B, and 10C correspondto the lapse of time in shooting. In addition, in FIGS. 10A, 10B, and10C, solid lines represents outlines of in-focus portions, and dottedlines represents outlines of out-of-focus portions.

FIG. 10A illustrates an image displayed in the display unit 103 when thevideo processing apparatus 100 reads a touch position. In other words,FIG. 10A illustrates an image displayed in the display unit 103 when atime corresponding to a read touch time has elapsed from when anoperation corresponding to an action call has been detected. Asillustrated in FIG. 10A, on the display screen 601 of the display unit103, the actor B is displayed at a position 1005, and the entrance 602and the door 603 opened by the actor B are displayed. In addition, onthe display screen 601, the actor A is displayed at a position 1004.Furthermore, as illustrated in FIG. 10A, a cursor is displayed on thedisplay screen 601 at the position 606 corresponding to informationindicating a touch position. In addition, in the example illustrated inFIG. 10A, a focus is not put on the actor B since the position 1005 ofthe actor B is on the outside of a predetermined range from the touchposition (the position 606).

FIG. 10B is a diagram illustrating the display screen 601 in the displayunit 103 of the video processing apparatus 100 that is displayed when anobject is detected in a range within a predetermined distance from (invicinity of) the touch position (the position 606). In addition, a frame1006 indicates a range which the object detection unit 801 has detectedan object (the actor B).

FIG. 10C illustrates an image displayed in the display unit 103 afterfocusing processing has been performed based on an object detectionresult. As illustrated in FIG. 10C, a focus is put on the position 1005of the actor B. In addition, the position 1004 of the actor A is in arange larger than the predetermined distance (distant) from the position606, and on the outside of an object detection target range. Thus,focusing processing is not performed with respect to the actor A, and afocus is not put on the actor A.

Next, FIG. 9 is a flowchart illustrating the details of the focusingprocessing that is executed by the video processing apparatus 100according to the present exemplary embodiment based on a stored focusposition.

The processes in steps S901 to S906 in FIG. 9 are processes similar tothe respective processes in steps S401 to S406 in FIG. 4 in the firstexemplary embodiment. Thus, the description thereof will be omitted. Inother words, the video processing apparatus 100 according to the presentexemplary embodiment can acquire information indicating a touch positionthat is stored in the storage unit 111, through the processes in stepsS901 to S906. In this example, similarly to the first exemplaryembodiment, information indicating a touch position (the position 606)that is stored in step S306 in FIG. 3 is acquired. In addition, wheninformation indicating a touch position has been read in step S906 ofthe present exemplary embodiment, as illustrated in FIG. 10A, the videoprocessing apparatus 100 displays a cursor at the position 606 on thedisplay screen 601 of the display unit 103 that corresponds to theinformation indicating a touch position. At this time, as illustrated inFIG. 10A, a focus is not put on the actor B since position 1005 of theactor B that has entered the frame is on the outside of thepredetermined range from the touch position (the position 606), unlikethe position of the actor B in rehearsal shooting.

In step S907, the video processing apparatus 100 performs objectdetection based on information indicating a touch position that has beenacquired from the storage unit 111 in step S906. More specifically, inthe present exemplary embodiment, the object detection unit 801 performsobject detection processing in a range within the predetermined distancefrom (in vicinity of) the touch position. In addition, if an object isdetected in step S907 of the present exemplary embodiment, asillustrated in FIG. 10B, on the display screen 601 of the display unit103, the frame 1006 indicating the range in which the object has beendetected is displayed. In addition, FIG. 10B indicates that the actor Bincluded in the frame 1006 has been detected as an object.

In step S908, the video processing apparatus 100 performs focusingprocessing based on an object detection result obtained in step S907.More specifically, the video processing apparatus 100 according to thepresent exemplary embodiment acquires the position of the objectdetected in step S907, and performs focusing processing with respect tothe range that is based on the acquired position. When the focusingprocessing is performed in step S908 according to the present exemplaryembodiment, as illustrated in FIG. 10C, an image in which a focus is puton the position 1005 of the actor B is displayed on the display screen601 of the display unit 103. In addition, the video processing apparatus100 according to the present exemplary embodiment uses the position ofan object as a center point of the object. This corresponds to, forexample, a center point 1007 of the frame 1006 of the detected objectthat is illustrated in FIG. 10B. For example, the video processingapparatus 100 according to the present exemplary embodiment uses thecenter point 1007 of the framer 1006 of the detected object that isillustrated in FIG. 10B, as a position of the object. The position of anobject, however, is not limited to a center point of the object. As aposition of an object, an end point and/or region of the object may beused, or a predetermined point and/or region within a range of theobject may be used.

The process in step S909 and a subsequent process are similar to theprocess in step S408 and a subsequent process in FIG. 4 in the firstexemplary embodiment. Thus, the description thereof will be omitted.

Through the processing in FIG. 9, the video processing apparatus 100according to the present exemplary embodiment can put a focus on theactor B even if a stand position of the actor B in real shooting (theposition 1005 in FIGS. 10A, 10B, and 10C) differs from a positioncorresponding to information indicating a touch position (the position606 in FIGS. 10A, 10B, and 10C). In other words, as illustrated in FIGS.10A, 10B, and 10C, by performing object detection with respect to arange in vicinity of the touch position (the position 606) in step S907,focusing processing is performed with respect to a region on the displayscreen 601 where an object has been detected (the frame 1006). As aresult, the video processing apparatus 100 according to the presentexemplary embodiment can put a focus on the actor B.

In addition, the video processing apparatus 100 according to the presentexemplary embodiment does not perform focusing processing in step S908if the video processing apparatus 100 has failed to detect an object instep S907 in a range that is based on the touch position. Theconfiguration of the video processing apparatus 100 according to thepresent exemplary embodiment, however, is not limited to this. Forexample, if the video processing apparatus 100 has failed to detect anobject in step S907, in step S908, the video processing apparatus 100may perform focusing processing so that a focus is put on a touchposition acquired in step S906. Alternatively, if the video processingapparatus 100 has failed to detect an object in step S907 in a rangethat is based on the touch position, the video processing apparatus 100may perform object detection processing with respect to a range outsidethe range that is based on the touch position. Then, in step S908, thevideo processing apparatus 100 may perform focusing processing so that afocus is put on an object at a position closest to the touch position,among objects detected outside the range that is based on the touchposition.

As described above, according to the video processing apparatus of thepresent exemplary embodiment, a focus can be put on a subject existingat a position different from a prestored focus position.

Furthermore, according to the video processing apparatus of the presentexemplary embodiment, if a prestored touch position on a display screendiffers from a position of a subject, by performing object detectionwith respect to a region in vicinity of a touch position, focusingprocessing can be performed with respect to a subject existing invicinity of the touch position.

In addition, the video processing apparatus of the present exemplaryembodiment can appropriately perform focusing processing even without ameasurement device used for measuring a distance in conventionalfocusing processing.

In addition, when a plurality of touch positions is stored, the videoprocessing apparatus of the present exemplary embodiment canappropriately perform focusing processing by using a touch time, withoutreceiving, from the user, an operation of selecting a touch position tobe used, from among the plurality of touch positions.

Third Exemplary Embodiment

The above-described video processing apparatus 100 according to thefirst exemplary embodiment reads stored information indicating a touchposition and information indicating a touch time, and performs focusingprocessing of putting a focus on the touch position corresponding to thetouch time. In addition, the video processing apparatus 100 according tothe second exemplary embodiment reads stored information indicating atouch position and information indicating a touch time, performs objectdetection within a specific range from the touch position, and performsfocusing processing of putting a focus on a position that is based on adetection result. A video processing apparatus according to a thirdexemplary embodiment reads stored information indicating a touchposition, and displays preannouncement information in a time periodbefore and after a touch time corresponding to the touch position (froma predetermined time before the touch time to a predetermined time afterthe touch time). In addition, the video processing apparatus accordingto the present exemplary embodiment displays, as preannouncementinformation, a cursor corresponding to the touch position, and counterinformation that is based on the touch time.

FIG. 11 is a diagram illustrating a configuration example of a videoprocessing apparatus according to the present exemplary embodiment. Inaddition, in FIG. 11 of the present exemplary embodiment, the unitshaving the same functions as the respective units in FIG. 1 of the firstexemplary embodiment are assigned the same reference numerals, and thedescription thereof will be omitted. The video processing apparatus 100according to the present exemplary embodiment differs from the videoprocessing apparatus 100 according to the first exemplary embodimentthat is illustrated in FIG. 1, in that a preannouncement display controlunit 1101 is included. The preannouncement display control unit 1101performs control so as to display preannouncement information at aposition on the display screen 601 of the display unit 103 thatcorresponds to information indicating a touch position that is stored inthe storage unit 111. Here, the preannouncement display control unit1101 according to the present exemplary embodiment performs control soas to display preannouncement information on the display unit 103 withbeing superimposed on an image output from the display control unit 104.In addition, in the video processing apparatus 100 according to thepresent exemplary embodiment, the preannouncement display control unit1101 performs control so as to display preannouncement information onthe display unit 103. The configuration of the video processingapparatus 100, however, is not limited to this. For example, the displaycontrol unit 104 may have the function of the preannouncement displaycontrol unit 1101.

The video processing apparatus 100 according to the present exemplaryembodiment performs processing for storing an instructed focus position,similarly to the processing in first exemplary embodiment that isillustrated in FIGS. 3 and 6A to 6C. Thus, the description thereof willbe omitted.

Next, the procedure of focusing processing that is executed by the videoprocessing apparatus 100 according to the present exemplary embodimentbased on a stored focus position will be described in detail using FIGS.12 and 13A to 13C. In addition, the video processing apparatus 100according to the present exemplary embodiment executes, in real shooting(the first take of the same shot), the focusing processing that isexecuted based on a stored focus position (processing in FIG. 12). Theexecution timing, however, is not limited to this, and the videoprocessing apparatus 100 is only required to execute the focusingprocessing (processing in FIG. 12) based on a stored focus positionafter the processing for storing an instructed focus position(processing in FIG. 3). For example, if processing for storing aninstructed focus position is executed in the shooting of the first takeof the same shot, the focusing processing that is based on a storedfocus position is executed in the shooting of the second and subsequenttakes. In addition, similarly to the case of performing the processingillustrated in FIG. 3, a shot video is shot using the studio setillustrated in FIG. 5.

First, FIGS. 13A, 13B, and 13C are diagrams illustrating the displayscreen 601 in the display unit 103 of the video processing apparatus 100that is displayed when the focusing processing is performed based on astored focus position. In addition, FIGS. 13A, 1313, and 13C correspondto the lapse of time in shooting. In addition, in FIGS. 13A, 13B, and13C, solid lines represents outlines of in-focus portions, and dottedlines represents outlines of out-of-focus portions.

FIG. 13A illustrates an image displayed in the display unit 103 apredetermined time before (in this example, 3 seconds before) a touchtime. As illustrated in FIG. 13A, on the display screen 601 of thedisplay unit 103, the actor B is displayed at a position 1505, and theentrance 602 and the door 603 opened by the actor B are displayed. Inaddition, on the display screen 601, the actor A is displayed at aposition 1506. Furthermore, as illustrated in FIG. 13A, on the displayscreen 601, a frame indicating preannouncement information is displayedat the position 606 corresponding to information indicating a touchposition. In addition, on the display screen 601, count information 1504is displayed based on a touch time and time information acquired fromthe time measurement unit 110. The count information 1504 is informationindicating a remaining time before a touch time (i.e., a time at whichinformation instructing a focus position has been input in rehearsalshooting). In addition, the count information 1504 according to thepresent exemplary embodiment that is illustrated in FIG. 13A isdisplayed as [−3], and indicates that 3 seconds left before the touchtime.

FIG. 13B illustrates an image displayed in the display unit 103 when thetime acquired from the time measurement unit 110 matches a read touchtime. As illustrated in FIG. 13B, on the display screen 601 of thedisplay unit 103, the actor B is displayed at a position 1508. Inaddition, as illustrated in FIG. 13B, a cursor illustrated at a position1507 corresponds to a touch position newly instructed by the user, andindicates that information instructing the position 1507 as a focusposition has been input. In addition, FIG. 13B illustrates an imagedisplayed at a time matching the read touch time, and in FIG. 13B, thecount information 1504 is displayed as [±0].

FIG. 13C illustrates an image displayed in the display unit 103 afterfocusing processing has been performed (in this example, 1 second afterthe touch time). As illustrated in FIG. 13C, a focus is put on the actorB existing at the position 1508, based on the touch position newlyinstructed by the user.

In addition, the video processing apparatus 100 according to the presentexemplary embodiment may be configured to change a shape or color ofpreannouncement information (the count information 1504 and/or thecursor at the position 606) based on a time acquired from the timemeasurement unit 110 and a touch time. By changing the shape or color inthis manner, preannouncement information of a time period closer to atouch time can be made more noticeable than preannouncement informationof a time period remoter from the touch time, so that display with highvisibility for the user can be performed.

Next, FIG. 12 is a flowchart illustrating the details of the focusingprocessing that is executed by the video processing apparatus 100according to the present exemplary embodiment based on a stored focusposition.

The processes in steps S1201 to S1204 in FIG. 12 are the same processesas the respective processes in S401 to S404 in FIG. 4 in the firstexemplary embodiment. Thus, the description thereof will be omitted. Inother words, in step S1201, the video processing apparatus 100 accordingto the present exemplary embodiment acquires a touch time stored in thestorage unit 111, and in step S1204, the video processing apparatus 100starts time measurement in response to the detection of an action call.

In step S1205, the video processing apparatus 100 determines whether atime measured by the time measurement unit 110 is a time a predeterminedtime before (X seconds before) the touch time acquired from the storageunit 111 in step S1201. In this step, if it is determined that a timemeasured by the time measurement unit 110 is a time a predetermined timebefore the touch time (YES in step S1205), the video processingapparatus 100 advances the processing to step S1206. On the other hand,if it is determined that a time measured by the time measurement unit110 is not a time a predetermined time before the touch time (No in stepS1205), the video processing apparatus 100 advances the processing tostep S1212.

In step S1206, the video processing apparatus 100 acquires informationindicating a touch position that is stored in the storage unit 111. Morespecifically, the coordinate acquisition unit 113 according to thepresent exemplary embodiment acquires (reads, calls) informationindicating a touch position that is saved in the storage unit 111. Inaddition, the information indicating a touch position is informationindicating a touch position that corresponds to information indicating atouch time that has been acquired in step S1201. In this example, thetouch position (the position 606) stored in step S306 in FIG. 3 isacquired.

In step S1207, the video processing apparatus 100 starts preannouncementdisplay of a touch position. More specifically, the video processingapparatus 100 displays preannouncement information at the touch positionacquired in step S1201, and displays information indicating a remainingtime before a touch time (count information). For example, when thevideo processing apparatus 100 of the present exemplary embodiment hasacquired information indicating a touch position in step S1206, asillustrated in FIG. 13A, the video processing apparatus 100 displays aframe indicating preannouncement information, at the position 606 on thedisplay screen 601 of the display unit 103 that corresponds to theinformation indicating a touch position. In addition, the videoprocessing apparatus 100 displays the count information 1504 based on atouch time acquired from the storage unit 111 and a time acquired fromthe time measurement unit 110. In this example, the count information1504 is information indicating a remaining time before a touch time(i.e., a time at which information instructing a focus position has beeninput in rehearsal shooting) in a countdown style.

In step S1208, the video processing apparatus 100 determines whetherinformation instructing a focus position has been acquired (detected).If the display unit 103 receives, from the user, an operation. (e.g.,touch operation) of instructing a new focus position, the input unit 106according to the present exemplary embodiment acquires informationinstructing a focus position. If information instructing a focusposition has been newly detected (touch operation has been detected) instep S1208 (YES in step S1208), the video processing apparatus 100advances the processing to step S1209. On the other hand, if informationinstructing a focus position has not been newly detected (No in stepS1208), the video processing apparatus 100 advances the processing tostep S1210.

In step S1209, the video processing apparatus 100 performs focusingprocessing based on the information instructing a focus position thathas been acquired in step S1208. More specifically, the input unit 106according to the present exemplary embodiment outputs the informationinstructing a focus position, to the instruction unit 105. Then, basedon the information instructing a focus position, the instruction unit105 acquires information indicating a touch position, and outputs theacquired information to the focus control unit 107. Then, based on theinformation indicating a touch position, the focus control unit 107performs processing for driving the lens 102 so that a focus is put onthe touch position (focusing processing).

In addition, if information instructing a focus position has beenacquired in step S1208 of the present exemplary embodiment, asillustrated in FIG. 13B, a cursor is displayed at the instructedposition 1507 on the display screen 601 of the display unit 103. Byreferring to the cursor displayed at the position 606 corresponding tothe touch position instructed in rehearsal shooting, the user canpredict a position on which focusing processing for the actor B is to beperformed. Furthermore, by referring to the displayed count information1504, the user can predict a timing at which focusing processing is tobe performed. In addition, after focusing processing has been performedin step S1209, as illustrated in FIG. 13C, a focus is put on the actor Bexisting at the position 1508. In addition, the count information 1504is continuously displayed. The count information 1504 and the cursorcorresponding to the touch position continue to be displayed until it isdetermined in step S1210 of the present exemplary embodiment that apredetermined time has elapsed from the touch time. Thus, the user canrecognize an elapsed time. Then, even if a timing at which the actor Breaches the position 606 corresponding to the touch position is delayed,the user can predict a position on which focusing processing is to beperformed.

In step S1210, the video processing apparatus 100 determines whether atime measured by the time measurement unit 110 indicates a predeterminedtime after (Y seconds after; X and Y may be the same or may bedifferent) the touch time acquired from the storage unit 111 in stepS1201. In this step, if it is determined that a time measured by thetime measurement unit 110 indicates the predetermined time after thetouch time (YES in step S1210), the video processing apparatus 100advances the processing to step S1211. On the other hand, if it isdetermined that a time measured by the time measurement unit 110 doesnot indicate the predetermined time after the touch time (No in stepS1210), the video processing apparatus 100 returns to the processing instep S1208.

In step S1211, the video processing apparatus 100 ends preannouncementdisplay of a touch position. More specifically, the video processingapparatus 100 ends display of preannouncement information for the touchposition acquired in step S1201, and information indicating a time thatis based on the touch time (count information). After the processing instep S1211, the video processing apparatus 100 then advances theprocessing to step S1212.

In step S1212, the video processing apparatus 100 determines whethershooting has ended. If it is determined that shooting has ended (YES instep S1212), the video processing apparatus 100 ends the focusingprocessing that is based on a stored touch position. If it is determinedthat shooting has not ended (NO in step S1212), the video processingapparatus 100 returns to the processing in step S1205.

Through the processing in FIG. 12, the video processing apparatus 100according to the present exemplary embodiment can put a focus on theactor B even if a stand position of the actor B in real shooting (theposition 1508 in FIGS. 13B and 13C) differs from a positioncorresponding to information indicating a touch position (the position606 in FIGS. 13A, 13B, and 13C). More specifically, as illustrated inFIGS. 13A, 13B, and 13C, in step S1207, preannouncement information (thecursor at the position 606 and the count information 1504) is displayed.Then, after checking the preannouncement information, the user performsan operation of instructing a new focus position. Through theprocessing, the video processing apparatus 100 according to the presentexemplary embodiment performs focusing processing with respect to thenewly instructed focus position. As a result, the video processingapparatus 100 can put a focus on the actor B.

In addition, if the video processing apparatus 100 according to thepresent exemplary embodiment has not newly detected informationinstructing a focus position in step S1208 illustrated in FIG. 12 (No instep S1208), the video processing apparatus 100 advances the processingto step S1210. The configuration of the video processing apparatus 100,however, is not limited to this. For example, as illustrated in FIG. 15,if the video processing apparatus 100 has not newly detected informationinstructing a focus position (No in step S1208), the video processingapparatus 100 may advance the processing to the processing in stepS1213. In step S1213, the video processing apparatus 100 determineswhether the time acquired from the time measurement unit 110 indicatesthat the touch time acquired from the storage unit 111 has passed, andfocusing processing has never been performed. Then, if it is determinedin step S1213 that the touch time has passed, and focusing processinghas never been performed (YES in step S1213), the video processingapparatus 100 advances the processing to step S1214. Then, in stepS1214, the video processing apparatus 100 may perform focusingprocessing based on the touch position read (stored) in step S1206. Inaddition, if it is determined in step S1213 that the touch time has notpassed, and/or focusing processing has ever been performed once or more(No in step S1213), the video processing apparatus 100 returns to theprocessing in step S1208. By performing the processes in steps S1213 toS1214 in FIG. 15 in this manner, the video processing apparatus 100 canperform focusing processing that is based on a stored touch position anda touch time, when the stored touch position matches a position of asubject, for example.

As described above, according to the video processing apparatus of thepresent exemplary embodiment, a focus can be put on a subject existingat a position different from a prestored focus position.

Furthermore, according to the video processing apparatus of the presentexemplary embodiment, a prestored touch position and a touch time can bedisplayed. More specifically, the video processing apparatus of thepresent exemplary embodiment can display preannouncement information(cursor corresponding to a touch position and counter information thatis based on a touch time) based on a stored touch position and a touchtime. With this configuration, the video processing apparatus of thepresent exemplary embodiment can assist the user to perform an operationof instructing a new focus position, in performing shooting again (inreal shooting). For example, even if a stand position and an operationtiming of an actor serving as a subject differ from a touch position anda touch time, the user can easily instruct a focus position again bychecking displayed preannouncement information.

In addition, the video processing apparatus of the present exemplaryembodiment can appropriately perform focusing processing even without ameasurement device used for measuring a distance in conventionalfocusing processing.

In addition, when a plurality of touch positions is stored, the videoprocessing apparatus of the present exemplary embodiment canappropriately perform focusing processing by using a touch time, withoutreceiving, from the user, an operation of selecting a touch position tobe used, from among the plurality of touch positions.

In addition, the video processing apparatus according to the presentexemplary embodiment is configured to display only preannouncementinformation in real take shooting. Nevertheless, the video processingapparatus may have a configuration obtained by combining the presentexemplary embodiment with the above-described first exemplary embodimentand/or the second exemplary embodiment. With this configuration, if itis determined that focusing processing is to be performed with respectto a position corresponding to touch position and at a timecorresponding to a touch time that are stored in rehearsal shooting, thevideo processing apparatus does not have to receive informationinstructing a new focus position. In other words, with thisconfiguration, the video processing apparatus may perform focusingprocessing that is based on a stored touch position and a touch time,when the stored touch position matches a position of a subject, forexample.

Fourth Exemplary Embodiment

In the above-described exemplary embodiments, the description has beengiven assuming that the units illustrated in FIGS. 1, 8, and 11 areformed by hardware components. Nevertheless, the processes illustratedin FIGS. 3, 4, 9, 12, and 15 that are performed by part or all of theunits illustrated in FIGS. 1, 8, and 11 may be implemented by computerprograms. The present exemplary embodiment will be described below usingFIG. 14. FIG. 14 is a block diagram illustrating a hardwareconfiguration example of a computer applicable to the video processingapparatus according to each of the above-described exemplaryembodiments.

A central processing unit (CPU) 1401 controls the entire computer usingcomputer programs and data stored in a random access memory (RAM) 1402and a read-only memory (ROM) 1403, and executes each of theabove-described processes in such a manner that the video processingapparatus according to each of the above-described exemplary embodimentsperforms the processes. In other words, the CPU 1401 functions as eachprocessing unit illustrated in FIGS. 1, 8, and 11.

The RAM 1402 has an area for temporarily storing computer programs anddata that have been loaded from an external storage device 1406, dataacquired from the outside via an interface (I/F) 1407, and the like.Furthermore, the RAM 1402 has a work area to be used when the CPU 1401executes various types of processing. In other words, the RAM 1402 canbe allocated as, for example, an image memory (specific memory), or canappropriately provide other various areas.

The ROM 1403 stores setting data of this computer, boot programs, andthe like. An operation unit 1404 includes a keyboard, a mouse, and thelike. The operation unit 1404 can input various instructions to the CPU1401 by being operated by the user of this computer. An output unit 1405outputs a processing result obtained by the CPU 1401. In addition, theoutput unit 1405 is formed by, for example, a liquid crystal display,and displays a processing result obtained by the CPU 1401.

The external storage device 1406 is a large-capacity information storagedevice typified by a hard disc drive. An operating system (OS) andcomputer programs for causing the CPU 1401 to implement the functions ofthe units illustrated in FIG. 14 are saved in the external storagedevice 1406. Furthermore, each piece of image data to be processed maybe saved in the external storage device 1406.

The computer programs and data that are saved in the external storagedevice 1406 are appropriately loaded into the RAM 1402 according to thecontrol by the CPU 1401, to be processed by the CPU 1401. Networks suchas a local area network (LAN) and the Internet, and other devices suchas a projection device and a display device can be connected to the I/F1407. This computer can acquire and transmit various types ofinformation via the I/F 1407. A bus 1408 is a bus for connecting theabove-described units.

In the operations having the above-described configuration, the CPU 1401plays a central role in controlling the operations described withreference to the above-described flowcharts.

Other Exemplary Embodiments

In each of the above-described first to third exemplary embodiments, theunits of the video processing apparatus are included in a singleapparatus. The configuration of the video processing apparatus, however,is not limited to this. In other words, part of the units of the videoprocessing apparatus may be included in another apparatus. For example,the display unit 103 and the input unit 106 of the video processingapparatus 100 may be included in another device (e.g., tablet device),and the other device and the video processing apparatus 100 may beconnected in a wired or wireless manner. Alternatively, the lens 102 andthe microphone 108 of the video processing apparatus 100 may be includedin another apparatus (e.g., imaging apparatus), and the other apparatusand the video processing apparatus 100 may be connected in a wired orwireless manner.

In addition, in each of the above-described first to third exemplaryembodiments, the video processing apparatus reads stored informationindicating a touch position, and displays a cursor at a position on adisplay screen that corresponds to the touch position. In addition, ineach of the above-described first to third exemplary embodiments, ifthere is a plurality of stored touch positions, the video processingapparatus may display cursors corresponding to the plurality of touchpositions, using different methods. For example, in a video, when afocus position is first instructed, a first touch position and a firsttouch time are stored, and when a new focus position is instructed next(secondly), a second touch position and a second touch time are stored.In such a case, the video processing apparatus may display a cursorcorresponding to the first touch position and a cursor corresponding tothe second touch position, using different display modes (e.g.,different colors).

According to the above-described various exemplary embodiments, inaccordance with a focus position instructed in the first shooting and atiming at which the instruction has been received, a focus position canbe controlled in the second shooting. In other words, positioninformation and time information that are based on a focus positioninstructed in the first shooting are acquired, and a focus position canbe controlled in the second shooting using these pieces of information.Consequently, in the second shooting, a focus position can be controlledwhile being adjusted in view of a shift from a position on which a focushas been put in the first shooting and a shift from a time at which thefocus has been put. Thus, a focus can be easily put on a desiredposition at a desired timing.

The present invention can also be implemented by the followingprocessing. More specifically, a program for implementing 1 or morefunctions of the above-described exemplary embodiments is supplied to asystem or an apparatus via a network or a storage medium, and 1 or moreprocessors in a computer of the system or the apparatus reads andexecutes the program. In addition, the present invention can also beimplemented by a circuit for implementing 1 or more functions (e.g.,application specific integrated circuits (ASIC)).

According to each of the above-described exemplary embodiments, inaccordance with a focus position instructed in the first shooting and atiming at which the instruction has been received, a focus position canbe controlled in the second shooting.

Other Embodiments

Embodiment(s) of the present invention can also be realized by acomputer of a system or apparatus that reads out and executes computerexecutable instructions (e.g., one or more programs) recorded on astorage medium (which may also be referred to more fully as a‘non-transitory computer-readable storage medium’) to perform thefunctions of one or more of the above-described embodiment(s) and/orthat includes one or more circuits (e.g., application specificintegrated circuit (ASIC)) for performing the functions of one or moreof the above-described embodiment(s), and by a method performed by thecomputer of the system or apparatus by, for example, reading out andexecuting the computer executable instructions from the storage mediumto perform the functions of one more of the above-describedembodiment(s) and/or controlling the one or more circuits to perform thefunctions of one or more of the above-described embodiment(s). Thecomputer may comprise one or more processors (e.g., central processingunit (CPU), micro processing unit (MPU)) and may include a network ofseparate computers or separate processors to read out and execute thecomputer executable instructions. The computer executable instructionsmay be provided to the computer, for example, from a network or thestorage medium. The storage medium may include, for example, one or moreof a hard disk, a random-access memory (RAM), a read only memory (ROM),a storage of distributed computing systems, an optical disk (such as acompact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™),a flash memory device, a memory card, and the like.

While the present invention has been described with reference toexemplary embodiments, it is to be understood that the invention is notlimited to the disclosed exemplary embodiments. The scope of thefollowing claims is to be accorded the broadest interpretation so as toencompass all such modifications and equivalent structures andfunctions.

This application claims the benefit of Japanese Patent Application No.2015-140045, filed Jul. 13, 2015, which is hereby incorporated byreference herein in its entirety.

What is claimed is:
 1. A video processing apparatus comprising: an inputunit configured to input information instructing a focus position, thefocus position being a position on which a focus is to be put, in afirst shooting; an acquisition unit configured, in a case in which theinformation instructing a focus position has been input by the inputunit, to acquire position information indicating the instructed focusposition, and time information indicating a time at which the focusposition has been instructed; and a control unit configured to control afocus position in a second shooting based on the position informationand the time information that have been acquired by the acquisitionunit.
 2. The video processing apparatus according to claim 1, furthercomprising: a detection unit configured to detect predeterminedidentification information; and a measurement unit configured to measurea time from a time point at which the predetermined identificationinformation has been detected by the detection unit, wherein the controlunit performs control so as to put a focus on a position that is basedon the position information acquired by the acquisition unit, accordingto the time measured by the measurement unit and the time informationacquired by the acquisition unit.
 3. The video processing apparatusaccording to claim 2, further comprising a second detection unitconfigured to detect an object, wherein the second detection unitdetects an object existing in a region that is based on the positioninformation acquired by the acquisition unit, according to the timemeasured by the measurement unit and the time information acquired bythe acquisition unit, and wherein the control unit performs control soas to put a focus on a position that is based on a detection resultobtained by the second detection unit.
 4. The video processing apparatusaccording to claim 3, wherein the second detection unit detects anobject existing in a region within a predetermined distance from aposition that is based on the position information acquired by theacquisition unit, according to the time measured by the measurement unitand the time information acquired by the acquisition unit, and whereinthe control unit performs control so as to put a focus on a position ofthe object detected by the second detection unit.
 5. The videoprocessing apparatus according to claim 1, further comprising a storageunit configured to store the position information and the timeinformation that have been acquired by the acquisition unit, inassociation with each other, wherein the control unit adjusts a focusbased on the position information and the time information that arestored in the storage unit.
 6. The video processing apparatus accordingto claim 1, further comprising: a second acquisition unit configured toacquire a video acquired by the second shooting; and a detection unitconfigured to detect predetermined identification information; ameasurement unit configured to measure a time from a time point at whichthe predetermined identification information has been detected by thedetection unit; and a display control unit configured to perform controlso as to display a graphic corresponding to the position informationacquired by the acquisition unit, with being superimposed on the videoacquired by the second acquisition unit, based on the time measured bythe measurement unit and the time information acquired by theacquisition unit.
 7. The video processing apparatus according to claim6, wherein the display control unit performs control so as to displayinformation indicating a difference between the time measured by themeasurement unit and the time information acquired by the acquisitionunit, and the graphic with being superimposed on the video acquired bythe second acquisition unit.
 8. The video processing apparatus accordingto claim 6, wherein the display control unit performs control so as todisplay the graphic at a position corresponding to the positioninformation.
 9. A video processing method comprising: an input step ofinputting information instructing a focus position, in a first shooting;an acquisition step of, in a case in which the information instructing afocus position has been input by the input step, acquiring positioninformation indicating the instructed focus position, and timeinformation indicating a time at which the focus position has beeninstructed; and a control step of controlling a focus position in asecond shooting based on the position information and the timeinformation that have been acquired by the acquisition step.
 10. Acomputer-readable non-transitory recording medium storing a program forcausing a computer to function as: an input unit configured to inputinformation instructing a focus position, in a first shooting; anacquisition unit configured, in a case in which the informationinstructing a focus position has been input by the input unit, toacquire position information indicating the instructed focus position,and time information indicating a time at which the focus position hasbeen instructed; and a control unit configured to control a focusposition in a second shooting based on the position information and thetime information that have been acquired by the acquisition unit.